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PREFACE 


The PDP-15/9 statistics package (STATPAC) is a FORTRAN-coded program used to perform 
statistical analysis on user-supplied data. STATPAC runs under control of the PDP-15 Advanced 
and Background/Foreground Monitor Systems and the PDP-9 Keyboard Monitor System, and 
requires some form of auxiliary bulk storage, such as DECtape or disk. This guide is intended 
to set forth operating procedures for the user, and does not contain detailed descriptions of 
the internal operations of the package. The Guide is organized as follows: 

Chapter 1 Introduction to STATPAC 

Chapter 2 Module Operating Procedures 

Chapter 3 Implementing and Augmenting STATPAC 

Chapter 4 Sample Operation 

Chapter 1 provides general descriptions of each of the modules in the package. Chapter 2 details 
the command dialogue and possible error messages. Chapter 3 contains information related to 
building an executable file and augmenting basic systems either through addition of user software 
modules or through expanding the hardware configuration. Chapter 4 contains sample dialogue 
and output for all STATPAC modules. The Appendix contains detailed algorithms for compu- 
tations performed within the package which will be of interest to the more demanding reader. 
Finally, a bibliography of statistical texts and applicable manuals is included for convenient 
reference. 

No attempt is made within the Guide to educate the novice statistician. It is assumed that the 
user has a good background in statistics and can use the package as a tool to achieve the desired 
results. 
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CHAPTER 1 
INTRODUCTION 

STATPAC is a FORTRAN-coded program used to perform statistical analysis on user-supplied data. The package is 
designed to run under control of PDP-15/9 monitor systems in a hardware configuration that includes 8K of core mem- 
ory, a console Teletype, a high-speed paper tape reader and punch, and two bulk storage units. Due to the limitations of 
8K core memory, the package is divided into logical modules, each of which consists of one or more core loads (i.e., 
chains or overlays). The modules (Figure 1-1) reside on a bulk storage device (logical —4) and include CONTROL, INPUT, 
SMMRY, STPRG, and MLTRG. Basic operation of the package requires that the user supply data to the INPUT module 
which prepares standardized binary data files. The user then can, depending upon his next task, select any one of the 
modules for operation. Briefly, the SMMRY (Summary) module provides the user with a set of descriptive statistics 
based upon his input files. The descriptive statistics include mean, variance, standard deviation, standard error of the 
mean, skewness, kurtosis, maximum, minimum, range, and a correlation matrix. The other two modules (STPRG and 
MLTRG) can be selected to perform stepwise linear regression and multiple linear regression, respectively. 

The following paragraphs provide a general description of each of the modules. User dialogues are presented in Chapter 2 
and detailed algorithms for the internal computations are given in Appendix A. 

1.1 CONTROL MODULE 

The CONTROL module acts as an executive routine, performing miscellaneous control functions while providing a means 
for communications between modules. Initially, the CONTROL module is loaded into core (see Chapter 3). Once 
loaded, it types the message 

*PROG 

The user must respond by typing one of the following names: 

INPUT 

SMMRY 

STPRG 

MLTRG 

EXIT 

By responding with EXIT, the user terminates all processing by STATPAC and control is returned to the monitor. Re- 
sponding with one of the module names causes the corresponding module (or a portion of it) to be loaded from the 
STATPAC tape (logical —4), overlaying the CONTROL module. Control is transferred to the module that has been loaded, 
and it requests and obtains the remaining control parameters required to perform an analysis by conducting a dialogue 
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Figure 1-1 ST ATP AC Logic Modules, Flow Diagram 










with the user (see Chapter 2). When the selected module has completed its task, it requests that the user supply the 
name of the next module to be loaded. If the user requests the module that is already in core, the module again requests 
the required control parameters. This continues until the user requests a different module, at which time the CONTROL 
module is loaded into core and, in turn, loads the selected module. 

1.2 INPUT MODULE 

The INPUT module performs two basic tasks: 

a. Conversion of user-supplied BCD data to binary. 

b. Preparation and storage of the standardized binary data files on a file structured bulk storage device. 

The user’s input data consists of observations, with each observation consisting of a number of variables. For example, 
each person Living in a town could be considered an observation consisting of the variables age, weight, height, etc. One 
can think of a data file, then, as a rectangular array or matrix of the form: 

Variable (1) Variable (2) Variable (L) 


observation 1 

^1,1 

Xi 2 

^l,L 

observation 2 

X2,l 

^2,2 

^2,L 

observation N 

%,1 

%,2 

%,L 


Note: 

i = observation 

^ii tVi 

y j = 1^11 variable 



This data file consists of N observations, with each observation consisting of L variables. One can think of the observa- 
tions as rows and the variables as columns. In using the statistics program, the user will frequently be asked: “What are 
the variables?”, to which he must respond by enumerating the column ordinals of the variables he wants analyzed. In 
brief, given the subscripts 1 , 2, ..., L, where each subscript is associated with one variable, the program is interested in 
how many and which variables were chosen. 

The standardized binary data files are organized on the tape written by the INPUT module as follows: 


Unit 1 has one record which contains L, the number of variables 
in each observation and the names of each variable. 

Unit 2 through K have one record which specifies the number of 
observations within the unit, N; and N records which contain the 
values of the variables for each observation. All units, except pos- 
sibly the have the same number of observations. Unit K may 
have less than N observations. 


The last unit (K+1) has one record which contains 0 to signal the 

j ^^4.^ 

ciiu ui Kiaia.. 
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1.3 SMMRY MODULE 


The SMMRY module reads data files designated by the user, analyzes the data, and outputs the following statistics for 
each variable which was selected by the user for analysis: 

Mean 

Variance 

Standard Deviation 

Standard Error of the Mean 

Skewness 

Kurtosis 

Maximum 

Minimum 

Range 

Correlation Matrix 
1.3.1 SMMRY Statistics 


NORMAL 



The previous statistics are estimates of the corre- 
sponding parameters of the populations from which 
the samples were drawn. The mean serves to specify 
the “center” of the data, while the standard devia- 
tion is a measure of the scatter, or dispersion, of the 
data from the center. The variance is the square of 
the standard deviation. 


STANDARD DEVIATION 



The figure at the left shows the changes in the shape of 
a curve effected by varying the standard deviation, a. 
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SKEWED DISTRIBUTION 


The skewness is used to measure the symmetry of a 
distribution about the mean. Since the normal dis- 
tribution is symmetric, skewness is used to test 
whether a distribution is not normal. 



The sign of the skewness statistic indicates the No Skewness 

direction of the skew as seen at the right. 



Kurtosis measures the relative concentration of 
values of a sample; i.e., about the “center”, 
the “tails”, and the “shoulders” of the distribu- 
tion. The illustration at the right compares 
curves with different degrees of kurtosis. 



The maximum is the highest observed value and the minimum is the lowest observed value. Their difference is the 
range. 

The correlation matrix indicates whether any pairs of variables in a file are highly correlated. Independent variables 
which are too highly correlated should not be used in the same regression analysis problem. 
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The descriptive statistics module (SMMRY) will also enable the user to perform hypotheses testing. 

Table 1-1 

ST ATP AC Symbol Definitions 
Symbol Definition 

L the number of variables in a user data file 



the ordinal of a particular variable in a file (1 <j<L) 

the number of files in an analysis 

the ordinal of a particular file (1 < i< K) 

the number of observations in the i*^^ file 

the observation ordinal (1 < m < N|) 

the variable in the m^^ observation of the i^^ file 

the standard deviation of the variable in the i^^ file 

the variance of the variable in the i^^ file 

*the calculated mean of the variable in the i^^ file 

*the actual mean of the variable in the i^^ file 

the user supplied test mean for the variable 

the user supplied test variance for the variable 

the observed value of the dependent variable 

the predicted value of the dependent variable determined using the 
regression model 

the calculated mean of the observed dependent variable 

th 

the coefficient of the i*^^ variable in a regression model 
the constant term of a regression model. 


*These symbols are used interchangeably in descriptions of hypotheses. 


1.3.2 SMMRY Options 

The SMMRY module of ST ATP AC includes six hypothesis test options. Each option permits the user to test one or 
more actual hypotheses. The user requests a specific option in response to the initial dialogue as described in Chapter 2. 

SMMRY Option 1 allows the user to test hypotheses which relate the calculated means for variables to user-supplied test 
means. These hypothesis tests may be performed upon one or more data files (up to 10 files). The statistic calculated 
for each file, however, is independent of that calculated for any other data file. 
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STATPAC calculates the following t-statistic when option 1 is requested: 

Under the assumption that the sample came from a normal population, the user can use the statistic tj| to test hypotheses 
which relate the calculated mean of a variable (pj-) to the user-supplied test mean for that variable (pj), as summarized 
below. 


Hypothesis 

Acceptance 

Alternative 


Criteria 

Hypothesis 


•^(1 - a/2) (Nj ■ 1) < ^ji < ‘ • ‘2 

Mji+Mj 

V/ 




‘ji>-‘(l-a)(Nj-l) 



When the acceptance criteria is not satisfied at the user-specified significance level (a), the alternate hypothesis is accepted 
The t-values are obtained from a statistical table using the values of Nj - 1 (the degrees of freedom) and the expression in 
a as the parameters for selecting the t-value from the table. 

SMMRY Option 2 allows the user to test hypotheses which relate the variance to a user-supplied test variance. These 
hypothesis tests may be performed upon one or more data files (up to 10 files). The statistic calculated for each file, 
however, is independent of that calculated for any other data file. 

STATPAC calculates the following chi-square statistic when option 2 is requested; 





Assuming a normal population, the xf\ statistic may then be used by the statistician to test hypotheses which relate the 
'y 9 

calculated variance (ajj) with the usef-supplied variance (oj ), as summarized below. 


Hypothesis 

Acceptance 

Alternative 


Criteria 

Hypothesis 

2 2 

2 / 2/ 2 
^ (all) (Nj - 1 ) < Xji < X (1 . cell) (Nj - 1) 

aJ + a: 

2^ 2 

Uji <a. 

2 2 

(1 -a)(Nj- 1) 


a. 2 > 0.2 

Ji J 

X|T>XW(Nj-l) 



When the acceptance criteria is not satisfied at the user-specified significance level (a), the alternate hypothesis is accepted 
The chi-square values are obtained from a statistical table using the values of Nj - 1 (degrees of freedom) and the expres- 
sion in a as the parameters for selecting the chi-square value from the table. 


SMMRY Option 3 allows the user to test hypotheses which relate the mean of a variable in one file to the mean of the 
corresponding variable (i.e., same ordinal) in another file. Thus, at least 2 files must be included in the analysis, but not 
more than 10 files may be analyzed. 
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ST ATP AC calculates the following t-statistics when option 3 is requested by the user: 

where 

s2 = {(N^.l)aj2 + (N^-l)aj2}/{N, + N^.2} 

In the calculation, r and s vary from for each value of j. Results are provided by STATPAC for each value of j 

(i.e., for each variable being analyzed). Thus, for every value of j, there is a K x K matrix generated (where K is the num- 
ber of files in the analysis). 

Under the assumption that the samples came from normal populations with a-; = a-":, the user can perform the following 
hypothesis tests for the variables of each possible pair of files in the analysis using the statistic tj^.^. Each hypothesis test 
relates variables with the same ordinal, but contained in different files. 


Hypothesis 

Acceptance 

Alternate 


Criteria 

Hypothesis 

"jr'Fjs 

•‘(1 - a/2) (Nj + Nj - 2) < ‘jrs ^ ‘(1 - “/2) (N, + - 2) 

Kjr + f-js 


‘jrs<*(l-a)(N, + N^-2) 



-a)(Nj + Nj,-2) 



When the acceptance condition is not satisfied at the user specified significance level (a), the alternate hypothesis is ac- 
cepted. The t-values are obtained from statistical tables using the values of N^. + - 2 (the sum of the separate degrees 

of freedom Nj. - 1 and - 1), and the expression in a as the parameters for selecting the value from the table. 

SMMR Y Option 4 allows the user to test hypotheses which relate the variance of a variable in one file with the variance 
of the corresponding variable (i.e., same ordinal) in a second file. Analysis of at least two files must be performed for 
this option to be executed, but no more than 10 files may be included in the analysis. 




When option 4 is requested, STATPAC computes the following 

where r and s vary from 1 ,2,...,K for each value of j. These F-values are output by STATPAC for each variable in the 
analysis (j, where 1 < j < L) and for all combinations of values for r and s (r, s = 1 ,2,...,K). Thus, for every value of j, 
there is a K x K matrix generated (where K = the number of files in the analysis). 


Under the assumption that the samples were drawn from normal populations, the user can perform the following 
hypothesis tests for a fixed variable and for each pair of files in the analysis, using the computed statistic F- . Each 
hypothesis test relates the variance of a variable with the variance of a variable having the same ordinal, but contained 
in a different file. 
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Hypothesis 

Acceptance 

Alternate 


Criteria 

Hypothesis 

2 2 

V = “js 

P(a/2) (N^ - 1, - 1) < Fjrs < P(1 - a/2) (N^ ■ ■ 1) 

"j? * 


Pjrs<P(l-a)(N^-l,N^-l) 



Pjrs>F(a)(Nj-l.N5-l) 



When the acceptance condition is not satisfied at the user-specified significance level (a), the alternate hypothesis is 
accepted. The F-values are obtained from statistical tables using the values of the degrees of freedom for each file, 

Nj. - 1 and Ng - 1, and the expression in a as the parameters for selecting the F-value from the table. 

SMMRY Option 5 allows the user to test the hypothesis that, for a particular variable, the means of that variable in all 
files of the analysis are equal at the user-specified significance level. Analysis of at least 2 files must be performed for 
this option to be executed, but no more than 10 files may be included. Option 5 is a generalization of option 3. 

When option 5 is requested, STATPAC computes the following F-statistic for each variable j analyzed; 


where 


K 


2Ni(Xji -Xj) 2]/(K-1) 
i = l 




K N, 


Z Z(^jim-V 


i = 1 m =1 


K 

/ 1 (Ni-l) 
i= 1 





9 2 

Under the assumption that all samples were drawn from normal populations with equal variance (i.e., o-f = a-f for all 
r, s = 1,2,...,K) the user may test the following hypothesis: 


Hypothesis 

Acceptance 

Alternate 


Criteria 

Hypothesis 

II 

lO 

II 

II 

Fj<P(l-a)(Vl,V2) 



where 

VI =K - 1 

K 

V2=Z (Nj-l) 

i = l 

for some r and s 


When the acceptance condition is not satisfied at the user-specified significance level (a), the alternate hypothesis is 
accepted. The F-values are obtained from statistical tables using the values of 1 - a, VI , and V2 as the parameters for 
selecting the F-value from the table. 
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SMMRY Option 6 allows the user to perform Bartlett’s test for equal variances for a particular variable in all data files 
in the analysis (normal populations are assumed). Analysis of at least 2 files must be performed for this option to be 
executed, but no more than 10 files may be included. Option 6 is a generalization of option 4. 

When option 6 is requested by the user, STATPAC computes the following chi-square statistic and correction: 




Ki N; 


‘i=l m=l 


K 


log, 0^1 Z Z (Xii„-Xii)"J /Z (Ni-O 


i=l 


K 

Z (Ni-1) 

i=l 


K 


N: 




Z (Ni-l)log,o £ 

i=l 'm=l 1 


C= 1 + 


(K-l)/3 


K 


. , (Ni-1) K 
1 ~1 

Z (Ni-1) 

i= I 


corrected j X^(k . ,) = j x2(K . ,) / C 

where C is the correction factor. Under the assumption that all samples are drawn from normal populations, the user 
may test the following hypothesis: 

Hypothesis Acceptance Alternate 

Criteria Hypothesis 

Ojl^ = Oj2^=- = <'jK jxV-l)<X^(l-a)(K-l) Or^+“s^ 

for some r and s 

2 

where j x may be the corrected or the uncorrected value computed by STATPAC. 

When the acceptance condition is not satisfied at the user specified significance level (a), the alternate hypothesis is ac- 
cepted. The chi-square values are obtained from statistical tables using the values of 1 - a and K - 1 as the parameters 
for selecting the chi-square value from the table. 

For a more complete description of the options, the reader is referred to Chapter 7 of Statistics in Research by Bernard 
Ostle. 


1 A STPRG AND MLTRG MODULES 

STPRG denotes the Stepwise Linear Regression Module and MLTRG denotes the Multiple Linear Regression Module. 
These modules are logically separate, but still have much in common (including a similar algorithm, input/output format, 
and internal organization). Because of their similarities, these modules are described together, with differences clearly 
noted where they exist. 
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1.4.1 Regression Analysis 


Assuming a set of N observations, where each observation consists of L + 1 variables, consider the first L-variabies to be in- 
dependent and denoted by Xj, i = 1,2,...,L. Consider the last variable to be a dependent variable, denoted y. To sum- 
marize, the data will appear as follows: 


^1,1 

Xl,2 

^],3 

... XjL 


^2,1 

^2,2 

^23 . 

. . . ^2,L 

^2 

%,1 

Xn,2 


• • %,L 

Yn 


Now, let us assume that there exists a model (i.e., a rule, relationship, formula, or equation) which defines y as a function 
of the X’s. This model can be expressed in the following manner: 

Y = Bo + B,Xj + B 2X2 + ... + BlXl + E 


where BQ,Bp...,Bj^ are the parameters of the model, and E is a true error to compensate for any discrepancies in the 
model. The task of regression analysis is to estimate or approximate this model, as follows: 

Y = bQ + b,x,+b2X2+... + bLXL + e 


where the b’s are estimates of the parameters and e is the residual of the estimating model. The estimating model is ap- 
plied to all observations in the set of data as follows: 


^ - u 

yi-^o 





+"i 


i=l,2,...,N 


where y- is the estimate of y- in the data, e^ = Yj * is the residual for the i^^ estimate, and N is the number of obser- 
vations. The “goodness” criteria of the estimating model is that the sum of the squares of the residuals must be a mini- 
mum (i.e., least squares). The criteria may be expressed as follows: 


N N 

Y, (yi ■ y^i)^ “ 2 ®i^ “ minimum 
i= 1 i=l 


A regression model with two independent variables is illustrated below. This hyperplane is determined as the best fit for 
the equation y = bQ + bjXj + ^>^ 2^2 ^y ST ATP AC regression analysis. The predicted values ( 5 ^^), the observed values (y-), 
and the residuals (e-) are shown for three observations. 
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Because each b| in the model is raised to the first power and because there are several x’s this analysis is referred to as 
multiple linear regression. At times, one may suspect that some of the independent variables do not significantly con- 
tribute to the prediction quality of the model. In such a case, one can examine the contribution of each variable to the 
model and, using the following criteria, include or exclude variables. To delete a variable, which at this point in time is 
in the model, the increase in the residual variance caused by the elimination of the variable from the model is calculated. 
If the increase is significant according to a user pre-selected level, the variable is deleted from the regression model; 
otherwise it remains in the model. To enter a variable, which at this point in time is not in the model, the decrease in the 
residual variance caused by the inclusion of this variable is calculated. If this decrease is significant, according to a user 
pre-selected level, the variable is added to the regression equation; otherwise it is not added. The technique of examining 
variables individually is denoted stepwise linear regression, (note that stepwise implies multiple variables, hence multiple 
linear regression). 

The output of regression analysis consists of; 

a. Correlation of each independent variable with the dependent variable 

b. For stepwise regression, output is for each step or iteration and includes: 

(1) Variable entering (or leaving) the model 

(2) Sequential F-test which is compared with the user supplied value of F-IN (or F-OUT) to determine 

inclusion (exclusion) of the variable in the model 

(3) The degrees of freedom for this iteration 

(4) R-squared, the multiple correlation coefficient 

(5) The change in multiple correlation from the previous iteration 
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(6) The standard error of the dependent variable 

(7) An analysis of variance table which includes an overall F-value which is used to test the hypothesis 
that all coefficients (except the constant term) are zero 

(8) A table of variables in the regression consisting of the respective coefficients, standard errors, and 
F-values needed to remove these terms from the model 

(9) The constant term of the model (Fq) 

(10) A table of variables not in the regression model and including the tolerance level, partial correlation 
with the dependent variable, and the F-value needed to enter these terms in the model 

c. For multiple Unear regression, the output includes: 

(1) The value of R-squared, the multiple correlation coefficient 

(2) The standard error of y 

(3) An analysis of variance table, as in stepwise regression analysis 

(4) The regression coefficients with their standard errors 

(5) The constant term of the model (bg) 

1.4.2 Regression Options 

The regression analysis modules have four options and four plots which may be requested by the user. The first three 
options allow the user to perform hypothesis tests. The last option is the output of residuals. (Analyses which involve 
many observations require considerable amounts of time to output residuals.) 

Option 1 of STPRG or MLTRG allows the user to test hypotheses Hq : b- = 0 or Hj : b- =5^ 0. ST ATP AC computes the 
value of t- = bj/s.e.(b-) for i = 1,2,.. .,p - 1. If lt-1 < t(N a), Hq is accepted; otherwise, Hj is accepted. The 

hypothesis is accepted or rejected at the 100 (1 - a )% significance (confidence) level. In the expression above, 

N is the number of observations in the data file 

p is defined by the model such that N-p = degrees of freedom (p is the number of coefficients in the model, 

y = bo + biy, + ... + bp.,yp.j) 

a. is the user selected significance level 

Option 1 output consists of the t-value (t-) for each term in the regression model, and the degrees of freedom. 

Option 2 is a generalization of option 1 which tests the hypotheses Hq : b- = bj and Hj : b^ 9^ bj where the value of b- is 
supplied by the user. (Thus option 1 is option 2 with b| = 0.) ST ATP AC computes t- = (b- - bj)/s.e. (b-) for i = 1 ,2,...,p - 1 . 
If lt|K t(N - p,l - 14 a), the hypothesis Hq is accepted at the 100(1 - d)% significance level. Otherwise the hypothesis 
Hj is accepted. The user is asked to supply the values for bj in the command dialogue whenever this option is selected. 
Option 2 output consists of the t-values (t-) for each term in the regression model, the user supplied coefficients from 
question 8 of the dialogue, and the degrees of freedom. 

Option i is a further generalization of options 1 and 2 and tests the hypothesis Hq : L < b- < H, where H and L are 
limits of the confidence interval for each coefficient in the regression model. The confidence interval is computed by 
STATPAC at the 100(1 - a) % confidence level as represented by the following equation: 

b- ±t(N-p,l -14Q!)(s.e.(b-)) 

The value of t(N - p,l - 14 a) is supplied by the user from a t-table, for a statistical value of a, in response to STATPAC 
dialogue. Option 3 output consists of the upper and lower bounds of the confidence intervals, and the degrees of 
freedom. 
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Option 4 is requested by the user when he desires output of residual values (output is on logical 3). Regression analysis 
with many variables will require considerable time to output the residuals. The residual output for each observation in 
the original file includes the observation ordinal, the observed dependent variable (y-), the predicted value of y (y-) by 
the model, and the residual for each observation (Cj = y^ - yj). 

1.4.3 Regression Plots 

Plot 1 is predicted y versus residual for each observation. The plot should appear as a broad horizontal band with no 
trend. A plot which shows an increasing broadness indicates that the variance is not constant and the regression is sus- 
pect. Each element of the grid is a counter with maximum value equal to 9. 

Plot 2 is predicted y versus observed y. The plot indicates the ability of the model to predict the observed y’s. The ideal 
situation would be the straight line, y = y. As in plot 1 , each element of the grid is a counter with maximum value equal 
to 9. 

Plot 3 is residuals versus ordinal. The residuals are plotted versus the ordinal of the observation to check for time factors 
in the model. A non-horizontal plot is an indication that the model may be time dependent. That is, either variance in- 
creases with time or terms in time should have been included in the model. 

Plot 4 is an overall plot of residuals. This histogram of residuals indicates normal distribution of the residuals. 

For further description of the plots and their implications, refer to Chapter 3 Applied Regression Analysis, by Draper 
and Smith (see Bibliography). Options 1, 2 and 3 are also described in Section 1.4 of the same reference. 
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CHAPTER 2 

MODULE OPERATING PROCEDURES 


ST ATP AC is loaded as a compiled and chained FORTRAN program (see Chapter 3 of this manual and the applicable 
Users’ Guide). Once ST ATP AC is loaded, the CONTROL module assumes control. The command dialogue and possible 
error messages for the five modules are described in this chapter. 

NOTES 

1. If the user has assigned the Teletype handler (TTA) to logical 5, he 
may use the RUBOUT key to erase one character to the left for each 
striking of the key. Control U may be typed to delete the whole line. 

These keys can only be used to erase up to the last carriage return. 

2. Characters typed by ST ATP AC are underlined in the following text 
to distinguish them from those typed by the user. 

3. Sample input and dialogue for each module is contained in Chapter 4. 

2.1 CONTROL MODULE 

2.1.1 Command Dialogue 

CONTROL contains one dialogue message which it types on logical 4. 

*PROG 

On logical 5, CONTROL expects one of the following five responses, left justified: 

INPUT 

SMMRY 

STPRG 

MLTRG 

EXIT 

2.1.2 Error Messages 

If the user does not respond with one of the above legal responses, the CONTROL module wiU type: 

♦ERROR 

♦PROG 

CONTROL then awaits the input of another module name. 
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2.2 INPUT MODULE 


2.2.1 Command Dialogue 

The execution of the INPUT module is directed by the following dialogue. The ASCII data file must be on logical 6. 
and the new file will be written on logical 1 . 


Question 1: 

*FILE (OLD) 
AAAAAAAAA 


Question 2; 

*FILE (NEW) 
AAAAA 


Question 3: 

*FORMAT 

(Si,S2....,S„) 


Question 4: 

*NO. OBS. 

mil 


Question 5 : 

*VARS 

VARII= AAAAA 
VARII= AAAAA 


VARII=AAAAA 


Question 6: 

*ZERQ OBS. 
AAA 


INPUT requests the user to identify the name of his data file with its nine character 
name (i.e., six character file name and three-character extension). If the device as- 
signed to logical 6 is not a bulk storage device, this name is of no consequence and 
a carriage return or blank card will suffice. If the device is bulk storage, the nine 
character name may be obtained from the directory listing and must be given 
exactly as given on the listing. 

Question 2 of INPUT requests the name to be associated with the standardized 
binary file which it will create. The user must respond with five characters in A5 
format as described in the FORTRAN Manual. The file will be identified with this 
name for all further analysis by ST ATP AC. If a non-bulk storage device is assigned 
to logical 1 , the name is of no consequence, and a carriage return or blank card will 
suffice. 

The user must respond by typing the format needed to read one observation of his 
BCD (ASCII) data. The number of fields specified in the format must correspond 
to the number of variables per observation (see Question 5). For a discussion of 
format statements, see the FORTRAN Manual. 

INPUT requests the number of observations in the user’s data file. The user responds 
with an integer value of five characters right justified in the five character field. 
(Preceding spaces or Os must be supplied by the user for numbers with less than 
five characters.) 

Question 5 requests the user to specify the names of the variables in his data file. 

He need not specify the names of all of the variables, but he must include the high- 
est variable subscript in the list which may not be greater than 15. The highest sub- 
script given by the user defines the number of variables per observation. The re- 
sponse must be in the form given below: 


Character Position 

Content 

1 through 3 

VAR 

4 and 5 

subscript II (01 < IK 15) 

6 

= 

7 through 1 1 

variable name in A5 


The list of variables is terminated by a blank record (e.g., a simple carriage return 
or blank card). 

The user is asked if he wants those observations which contain at least one 0 variable 
printed. If the user wants these observations, he responds by typing “YES” in A3 
format. Any other response (e.g., “NO”, or a simple carriage return) will suppress 
the typing of such observations. (See Section 2.2.2 for a more complete description.) 
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Comment: 

*O.K. 


INPUT outputs this message to terminate the dialogue, and to indicate that proces- 
sing will now begin. No response is necessary. 


NOTE 

Once a user BCD data file has been written in standardized binary format 
on logical 1 by INPUT, the user need not generate the binary file again. 


2.2.2 Error Messages 

If the user types an illegal subscript (greater than 1 5) to Question 5 (*VARS), INPUT wdl type the following message 
and will not accept the line which was typed. 

*IGNORED 

The user may continue with legal responses to the question. 

If a subscript less than 1 is typed, the input list is terminated and the following question will be asked (ZERO OBS.). 

If a bulk storage device is assigned to logical 6 and the user does not respond with the exact name of a file in response 
to Question 1 (*FILE (OLD)), lOPS 13 will result.^ No recovery is possible except by restarting execution of STATPAC. 
Restarting is accomplished by typing CNTRL/C after lOPS 13. The monitor from which execution may be restarted 
will be loaded. 

If a variable of a particular observation does not conform to the user-specified format (answer to the question *FORMAT), 
the variable in question may be recorded as 0 in the standardized file created by INPUT. The question “ZERO OBS.” 
allows the user to monitor 0 values for such losses. If the user requests such output, INPUT will type the entire obser- 
vation on logical 4 together with the observation ordinal. The option cannot, however, distinguish between valid and 
assigned 0 values. 

NOTE 

1. Only five characters (A5) for a file name are supplied by the user in 
Question 2. STATPAC supplies lj STP as the remaining four characters 
of the file name in the bulk storage directory. 

2. All variables in BCD data on logical 6 must be real, i.e., E-, F- or 
G-type conversion. 

2.3 DESCRIPTIVE STATISTICS MODULE 

2.3.1 Command Dialogue 

Question 0: 

*PROG 
SMMRY 

Question 1: 

*FILE 
AAAAA 
AAAAA 

AAAAA 

1 » 

Consult the Users’ Guide of your computer system for a description of the lOPS 13 error. 


The module currently in command requests the user to specify the next module 
to be used. Assume the user answers with the name of the descriptive statistics 
module, SMMRY. 

The user is asked to specify the files to be analyzed by the SMMRY module. The 
response consists of the names assigned to the files during execution of the INPUT 
module (INPUT Question 2). Response must be in A5 format, left justified. As 
many as ten files may be analyzed at one time. The user terminates the list of file 
names by supplying a blank record (e.g., a simple carriage return or a blank card). 


2-3 



Question 2: 
*VARS 
II 
II 


II 

) 

Question 3 : 
*QPTS 
IIIIII 


Question 4: 

*MEAN 

MENII=±XXX.XXXX 

MENII=±XXX.XXXX 


MENII=±XXX.XXXX 


Question 5 : 

*VRNC 

VARII=±XXX.XXXX 

VARII=±XXX.XXXX 


VARiI=±XXX.XXXX 


The user is asked to list the subscripts of the variables to be analyzed. The response 
is in 12 format, right justified. At most, 15 subscripts may be listed, and the values 
of II must be 01 < II < 15. The list of subscripts is terminated by a blank record. 


The user is requested to indicate which, if any, hypothesis tests are desired. The 
user responds with a 0 if he does not want a specific option and with a 1 (or any 
positive integer) if he does want an option. The first position represents option 1 , 
the second position represents option 2, etc. If no options are desired, the user may 
respond with a blank record. Examples: 

001101 User requests options 3, 4, and 6. 

100000 User requests option 1 only. 

If option 1 is requested in response to Question 3, the user is requested to provide 
a test mean for each variable in the analysis. The user must respond in the following 
form. 


Character Position 

Content 

1 through 3 

MEN 

4 through 5 

subscript II (01 < II < 15) 

6 

= 

7 through 15 

test mean in F9.6 


The list is terminated by a blank record. An entry for a subscript may be retyped 
and only the last appearance will be used. If a variable which is in the analysis by 
virtue of being listed in Question 2 is not assigned a test mean, a default test mean 
of 0 is assumed. 

The user who has requested option 2 in Question 3 is requested to provide a test 
variance for each variable in the analysis. The user must respond in the following 
form. 


Character Position 

Content 

1 through 3 

VAR 

4 through 5 

subscript II (01 < II < 15) 

6 

= 

7 through 15 

test variance in F9.5 


24 





The list is terminated by a blank record. An entry for a subscript may be retyped 
after it has already been entered and only the last appearance will be used. If a 
variable in the analysis is not assigned a test mean, because it was listed in Question 2 
a default test mean of 0 is assumed by STATPAC. 

Comment; 

*Q.K 

2.3.2 Error Messages 

If the user responds with a subscript greater than 15 in response to Questions 2, 4, or 5, that particular line is completely 
ignored but no message is typed. If the user responds with a non-positive subscript to one of these same questions, this 
is treated as a list terminator and the dialogue proceeds to the next question. 

If the user responds to Question 2 (*VARS) with a subscript which is greater than the number of variables/observations 
in some data file which he has listed in Question 1 (*FILE), the message 

*ERR1 

will be output on logical 4. Note that the number of variables per observation is defined by the user’s response to 
Question 5 of INPUT. When this error condition exists, the file in question is excluded from further analysis and pro- 
cessing continues. If all the user listed files from Question 1 are eliminated from analysis, the following message will be 
output on logical 4: 

*ERR2 

If the user requests option 3, 4, 5, or 6 and has listed only one file name for analysis in response to Question 1, these 
options will not be processed and no output will result, since they are meaningless for only one file. 

NOTES 

1 . When the elements of the correlation matrix are being calculated, the 
terms 

N N 

m=l m=l 

are used in the denominator of the expression for calculating c-j. If both 
of these terms are not larger than TOL = .lE-9, then c-- is given the de- 
fault value 2.0. 

2. If, during the processing of option 2, a user supplied variance is found 
to be less than or equal to TOL = .lE-9, the corresponding statistic is given 
the default value of 1 .E76. Similarly, if during the processing of option 1 , 

a standard deviation is calculated from the data file which is less than .TOL, 
the corresponding statistic is assigned the default value .1E76. Option 4 
operates similarly. 

2.4 REGRESSION ANALYSIS MODULES 
2.4.1 Command Dialogue 

Question 0: The module currently in command types this question. The user is assumed to have 

*PROG typed either STPRG or MLTRG. 

STPRG 

MLTRG 


STATPAC indicates the termination of dialogue and the start of the requested 
analysis by typing the comment O.K. 
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Question 1 : 
* FILE 
AAAAA 

Question 2: 
* VARS 
II 
II 
II 


II 

Question 3 : 

*FIN 

xxxx.xxxx 


Question 4: 

*FOUT 

XXXX.XXXX 

Question 5: 

* LIM 

II 


Question 6: 

* TOL 

XXXX.XXXX 


Question 7; 
♦OPTS 
IIII 


STPRG or MLTRG requests the name of the data file to be analyzed. The user 
responds in A5 with the exact name of the file which was given in answer to INPUT 
Question 2. 

STPRG or MLTRG requests the subscripts of the variables to be analyzed. The user 
must respond in 12, followed by a carriage return, and right justified in the two char- 
acter field. Values must be 01 < II < 15. The last subscript of the list will be con- 
sidered the dependent variable. Each subscript is terminated by a carriage return, 
and the list is terminated by a blank record (e.g., an extra carriage return or blank 
card). If all variables of the file are to be analyzed, the user need only type the sub- 
script of the dependent variable. 


STPRG requests the F-value which will be used to determine if a variable not in the 
model makes a significant contribution to the model and, therefore, should be added. 
The user responds in F9.5 followed by a carriage return. 


NOTE 

This question is asked only when the user response to Question 0 is STPRG. 


Question 4 of STPRG requests the F-value to determine if the contribution of a vari- 
able, which is in the estimating model, is insignificant and should therefore be ex- 
cluded from the model. Response is in F9. 5. See NOTE for Question 3. 


Question 5 of STPRG requests the user to specify the number of iterations or cycles 
to be allowed in the calculation of the estimating model. The limit prevents STPRG 
from getting into a nonproductive loop of successively including and excluding vari- 
ables in the model. If the user responds with other than a positive integer, STPRG 
will use a default limit equal to twice the number of independent variables being 
analyzed. See NOTE for Question 3. 

Question 6 requests the tolerance factor used by STPRG and MLTRG. The tolerance 
is used to check for constant observations and to check the diagonal elements of the 
correlation matrix to avoid trying to invert a badly behaved matrix. Values for TOL 
are usually between 0.001 and 0.0001. If the user responds with a blank record, 

ST ATP AC uses a default value of TOL = .001 . 


STPRG or MLTRG requests the user to specify which options are desired. The four 
options are described in detail in Section 1.4.2. The user responds w’ith a 0 if he does 
not want an option or with a 1 if he does want the option. Examples: 


1000 

Option 1 only 

1010 

Options 1 and 3 only 

0011 

Options 3 and 4 only 
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Question 8: 

* COEF 

COFII=XXXX.XXXX 

COFII=XXXX.XXXX 


COFII=XXXX.XXXX 


Question 9: 

* FCTR 

XXXX.XXXX 


Question 10; 
*PLTS 

mi 


Comment: 

*O.K. 


This question is asked only if the user requests option 2 in Question 7 (i.e., he types 1 
in the second position of the response to Question 7). The user must respond as out- 
lined below. 


Character Position 

Content 

1 through 3 

COF 

4 and 5 

subscript II (01 < II < 15) 

6 

= 

7 through 1 5 

test coefficient in F9.5 


The list is terminated by a blank record. 

This question is asked only if the user requests option 3 in Question 7 (i.e., he types 
a 1 in the third position of the response to Question 7). The user must respond with 
the value of t(N - p, 1 - a) in F9.5. The t-value is obtained from a table by esti- 
mating the degrees of freedom (N-p) and specifying a confidence level. The actual 
degrees of freedom are output by the regression module and may be checked against 
the estimated value used to obtain the response. 

The user is requested to specify the output plots which he would like. The plots are 
described in Section 1.4.3, and are listed below. The user types a 1 in the position 
corresponding to those plots he wants and a 0 in the positions corresponding to plots 
he does not want. Examples: 

1000 Plot 1 only 

0010 Plot 3 only 

1011 Plot 1,3, and 4 

The regression analysis modules type this message to indicate termination of the 
dialogue and start of the processing. 


NOTE 

An “A” appears in plotted output when the value to be plotted is a coun- 
ter which exceeds 9. 


2.4.2 Error Messages 

If the user responds to Question 2 (*VARS) or Question 8 (*COEF) with a subscript value greater than 15, the line in 
question will be ignored. If a non-positive subscript is typed, the list in question is terminated and STATPAC types the 
next question in the dialogue. 

If the following expression is less than the user-supplied tolerance for a specific variable with subscript j : 

U H ^ < tol 

where N is the number of observations, the variable is considered constant by STATPAC and the following error mes- 
sage is output on logical 4; 

*ERR 1 j 
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(The value j is the subscript of the variable which caused the error.) ERR 1 will terminate processing and a new dialogue 
will begin. 


If the MLTRG module is being used and not all the independent variables can be entered (due to the choice of TOL in 
part) the following error message will be output on logical 4 : 

*ERR2 

ERR 2 will terminate processing and ST ATP AC will begin a new dialogue. 

If the user has responded with a subscript greater than the number of variables/observation, STATPAC outputs the fol- 
lowing error message: 

*ERR3 

ERR 3 will terminate processing and cause STATPAC to begin a new dialogue. Note that the number of variables per 
observation is defined by the user’s response to Question 5 of INPUT. 

When the STPRG module is being used, a limit factor (LIM) is used to limit the number of passes in the stepping algorithm. 
Exceeding this limit causes the following error message on logical 4: 

*ERR4 

ERR 4 terminates processing and STATPAC begins a new dialogue. 

If no variables are entered into the regression model when using either the MLTRG or the STPRG modules, the following 
message is output on logical 4: 

*ERR5 

ERR 5 terminates processing and STATPAC begins a new dialogue. 

NOTE 

When any of the above error conditions occur, the user should not in- 
discriminately adjust the values of TOL, FIN, LIM, etc., to force a com- 
plete analysis. The user should closely examine the variables and limits 
involved before making any such adjustments. 
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CHAPTER 3 

IMPLEMENTING AND AUGMENTING STATPAC 

This chapter describes the procedure to be followed when building a STATPAC executable file using the PDP-15 and 
PDP-9 Monitors. Refer to CHAIN and EXECUTE of the Users’ Guide for the general description for chaining the 
STATPAC program. The procedure for building an executable file with specific hardware and handler assignments is 
described later in this chapter. 

STATPAC modules make use of the following logical units. Specific device handlers must be assigned to these units when 
the executable file is built. 

Logical Unit Function within STATPAC 

4 Contains STATPAC in executable form in lOPS binary. 

1 Contains standardized binary data files written by the INPUT module 
in lOPS binary. 

2 Stores temporary files during processing by a STATPAC module in 
lOPS binary. 

3 Hard copy statistical output in lOPS ASCII. 

4 Program queries and error messages in lOPS ASCII. 

5 User responses to queries in lOPS ASCII. 

6 User suppHed BCD data files as input to the INPUT module in lOPS 
ASCII. 

7 Temporary storage of residuals (regression) and temporary storage of 
option output for SMMRY in lOPS binary. 

If the user assigns a bulk storage device to logical unit 3, the output of STATPAC will be recorded in files with the fol- 
lowing file names; 

Descriptive Statistics 

SMMRY STP Contains the standard SMMRY module output 

OFTON STP Contains the output of the options for SMMRY 
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Regression Analysis 


REGRS STP Contains the standard regression output (for each step in the case of STPRG) 

and the residual output, if requested 

OPTPL STP Contains the output for regression options 1, 2, and 3 and the plotted output, 

if requested 

3 . 1 BUILDING AN EXECUTABLE FILE 

ST ATP AC is supplied to the user in three forms: 

a. Source files for each ST ATP AC chain. 

b. Binary files of the FORTRAN compiled STATPAC source for each chain. 

c. An executable file with fixed handler assignments, chained as described in this section. 


The files are organized according to the following chart. 


Module 

Source File 

Binary File 

CONTROL 

CHOI 

SRC 

CHOI 

BIN 

INPUT 

CH03 

SRC 

CH03 

BIN 

SMMRY 

CH06 

SRC 

CH06 

BIN 

SMMRY 

CH07 

SRC 

CH07 

BIN 

SMMRY 

CH08 

SRC 

CH08 

BIN 

SMMRY 

CH09 

SRC 

CH09 

BIN 

STPRG & 

CHIO 

SRC 

CHIO 

BIN 

MLTRG 





STPRG & 

CHll 

SRC 

CHll 

BIN 

MLTRG 





STPRG & 

CH12 

SRC 

CH12 

BIN 

MLTRG 





STPRG & 

CH13 

SRC 

CH13 

BIN 

MLTRG 






NOTE 


16K of core memory is required to build the STATPAC executable file, 
although it will operate in an 8K memory. 

An executable file is produced from the above chains by following the steps outlined below. The description assumes 
that the user has two DECtapes as bulk storage devices for the Monitor System. Assuming also that the compiled 
binary files are on DECtape unit 1 ; the user should make the following handler assignments: 

$A DTAO -1 
$A DTAl -4,-6 
$ADTB1 1,2,6 ,7 
$A TTA 3,4,5 

epur A TXT 
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Once CHAIN has been loaded from the system tape, the message “CHAIN V2A” is typed as shown below. The user then 
types the command “BUILD STATPC”, thereby identifying the executable file to be buht. The user then types aU re- 
sponses which are preceded by a > in the following listing. (Lines preceded by a > indicate user response is required. 
Lines not preceded by a > are typed by CHAIN.) 


CHAIN V2A 


>BUILD 

STATPC 

>C 1 


>CH01 


>END 


CH01 

36655 

BCDI 0 

33662 

.ss 

33603 

GOTO 

33555 

STOP 

33542 

SPMSG 

33447 

FIOPS 

32713 

OTSER 

3261 7 

INTEGE 

32467 

REAL 

31 466 

CHAIN# 

1 

LOWEST 

31466 

COMSZE 

000 10 


>C 3 

>CH03 

>END 


CH0 3 

34672 

DTB. 

32647 

FILE 

31 325 

.DA 

31256 

BCDIO 

26263 

BINIC 

2 60 1 2 

• SS 

25733 

FIOPS 

251 77 

OTSER 

25103 

I NTEGE 

24753 

REAL 

23752 

CHAIN# 

3 

LOWEST 

23752 

COMSZE 

00010 


>C 6 
>CH0 6 
>END 


CH0 6 

35077 

DTB. 

32054 

FILE 

31532 

.DA 

31463 

BCDIO 

26470 

BINIO 

2621 7 

.SS 

26140 

GOTO 

261 12 

FIOPS 

25356 

OTSER 

25262 

INTEGE 

251 32 

REAL 

241 31 
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CHAIN# 

6 

LOWEST 

24131 

COMSZE 

00167 

•C 7 
>CH0 7 
‘END 

CH0 6 

33571 

DTB. 

30546 

FILE 

30224 

FLOAT 

30213 

SORT 

30125 

.DA 

300 56 

BINIO 

27605 

.ss 

27526 

FIOPS 

26772 

OTSER 

26676 

INTEGE 

26546 

REAL 

25545 

CHAIN# 

7 

LOWEST 

22545 

COMSZE 

00167 


>C 8 

>CH08 

>END 


CH08 

33620 

DTB. 

30575 

FILE 

30253 

ABS 

30235 

FLOAT 

30224 

SORT 

30136 

ALOG10 

30116 

.EE 

30025 

.EC 

27761 

.DA 

27712 

BINIO 

27441 

.SS 

27362 

GOTO 

27334 

FIOPS 

26600 

OTSER 

26504 

INTEGE 

26354 

REAL 

25353 

CHAIN# 

10 

LOWEST 

25353 

COMSZE 

00167 


>C 9 

>CH09 

>END 


CH09 

3551 1 

DTB. 

32466 

FILE 

32144 

.DA 

32075 

BCDIO 

27102 

BINIO 

26631 

.SS 

26552 

GOTO 

26524 

FIOPS 

25770 

OTSER 

25674 

INTEGE 

25544 

REAL 

24543 



CHAIN# 

1 1 

LOWEST 

24543 

COMSZE 

00 1 67 

•C 10 


■CH10 


>END 


CH10 

34420 

DTB. 

31375 

FILE 

31053 

• DA 

31004 

BCDI 0 

260 1 1 

BINIO 

25540 

.ss 

25461 

GOTO 

25433 

FIOPS 

24677 

OTSER 

24603 

INTEGE 

24453 

REAL 

23452 

CHAI N# 

1 2 

LOWEST 

23452 

COMSZE 

00125 


>C 1 1 
>CH1 1 
>END 


CHI 1 

31553 

DTB. 

26530 

FILE 

26206 

ABS 

26170 

lABS 

26154 

FLOAT 

26143 

SORT 

26055 

.DA 

26006 

BINIO 

25535 

.SS 

25456 

FIOPS 

24722 

OTSER 

24626 

I NTEGE 

24476 

REAL 

23475 

CHAIN# 

1 3 

LOWEST 

23475 

COMSZE 

00273 


>C 12 
>CH1 2 
>END 


CHI 2 

33003 

DTB. 

27760 

FILE 

27436 

ABS 

27420 

.DA 

27351 

BINIO 

27100 

.SS 

270 1 2 

GOTO 

267 73 

FIOPS 

26237 

OTSER 

261 43 

INTEGE 

260 1 3 

REAL 

2 50 1 2 



CHAIN# 14 
LOWEST 25012 
COMSZE 00273 


>C 13 


>CH1 3 


>END 


CHI 3 

3521 7 

DTB. 

321 74 

FILE 

31652 

• DA 

31603 

BCDIO 

26610 

BINIO 

26337 

• SS 

26260 

GOTO 

26232 

FIOPS 

25476 

OTSER 

25402 

INTEGE 

25252 

REAL 

24251 

CHAIN# 

1 5 

LOWEST 

24251 

COMSZE 

00025 


>CLOSE 


CHAiig V2A 
>EXIT 


MOiOITOL yjA-'^ 


NOTE 

Chain numbers are typed by the user in decimal, but CHAIN prints the 
chain numbers in octal. 

When the message “CHAIN V2A” is typed at the end of the listing, STATPC is on DECtape 1 in executable format. 
After calling the MONITOR, STATPC is executed by typing: 

A DTCl 4 
E STATPC 


NOTE 

STATPC XCT is the name of the executable file which is stored on 
logical unit 1 by CHAIN. 

The assignments made to build the executable file described in this chapter result in all files (ASCII, binary, or tempo- 
rary) being stored on DECtape 1, and all hard copy output being on the Teletype unit, which is also used for the dia- 
logue of the modules. 

The user can increase the processing and output speed by optimally assigning peripheral handlers to the STATPAC log- 
ical units. 
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3.2 ADDING PROCESSING MODULES TO STATPAC 


Li.1V iiOVl iiCtO i'liV <XKJXXX\.y 


pl-ktl-i-I-17 +/-\ r^AA AM t^r\AAA m/^Huloo -f/^ QT A ' I A K\r -fAllrwiyirirT o •fo\i7 c-imr^lo /^/^rnmn+iz-^nc 


JL X. X.‘\~£~XJ.'\ WMVSpI Xkl\JSJ.^XXSyO C^V/ 


OiiiiMiV Wli T Vlil.lV^lii3« Jl 


example, assume the user wishes to add a module which is written in one chain to STATPAC. The name of the new 
module is ABCDE. The chain must include the following statements: 


COMMON ICNTRL, CPBLTY, IFLAG, ... 

T\ A <T^ A A 'n/^TM^ /OTT A ri^T^T? / 

i^Ai A Ai3i^i7ii/bnAD^L)i:// 


C PROCESSING BEGINS HERE 
100 CONTINUE 

C PROCESSING IS FINISHED 

200 WRITE (4,201) 

201 FORMAT (6H*PROG) 

READ (5,202) CPBLTY 

202 FORMAT (A5) 

IF (CPBLTY.EQ.ABCDE) GO TO 100 
C DIFFERENT MODULE REQUESTED, CALL CONTROL MODULE 
CALL CHAIN (1) 

END 

If the newly added module must read a data file which was written by INPUT, the file (on logical- 1) could be read with 
the following coding: 


C READ AAAAA, WHICH WAS GIVEN BY USER IN QUEST. 2 OF INPUT. 
READ (5,100) FILE (1) 

100 FORMAT (A5) 

C FILE (2)=4H STP 

CALL SEEK (1, FILE) 

READ (1) L, (NAME(I),I=1,L) 

C INITIALIZE OBSERVATION COUNTER 
N=0 

103 READ (1) NO 

IF (N0.EQ.0) GO TO 102 
N=N+N0 

DO 101 N01=1,N0 
READ(1)(X(I),I=1,L) 
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101 CONTINUE 
GO TO 103 

C ALL OBSERVATIONS READ 
C N TOTAL NUMBER OF OBSERVATIONS 

102 CALL CLOSE (1) 


The format of the files formed by INPUT which must be read by a user written module for analysis is given in 

Section 1.2. This format is summarized below. The file is stored on a bulk storage device with directory entry AAAAA 
STP where the name AAAAA is supplied by the user in response to Question 2 of INPUT and the STP is automatically 
supplied by STATPAC. In the following description, L is the number of variables (the highest acceptable subscript sup- 
plied in Question 5 of INPUT). 

[L, NAME(l),.. 

.,NAME(L) ] 

Contains the number of variables per observation and their 
respective names 

[NO] 


NO is the number of observations which follow 

[Xi,X2,...Xl] 


Contains one observation 

[XpX2,...,XL] 


Contains one observation 

[NO] 


NO is the number of observations which follow 

[Xi,X2,...Xl] 


Contains one observation 

[XpX2,...,XL] 


Contains one observation 

[0] 


Contains 0 to indicate that zero observations follow, i.e., the 
end of the data file 

The entries in COMMON which must be made in the user supplied module are the following: 

ICNTRL 

Used by the CONTROL module 

CPBLTY 

Used if the user requests a different module once processing is completed 
by the module presently in core (used to read the answer to “*PROG”) 

IFLAG 

May be used in the user supplied program to define multiple entry points 
into a chain if the module occupies more than one chain (set to 1 by 

CONTROL) 


When the user adds a module to STATPAC, the CONTROL module must be modified to allow the new module to be 
called. The following changes must be made to CONTROL, assuming that the new module is named ABCDE and that 
chain number 20 is assigned to it when the executable file is built. (The CONTROL module is chain 1 .) 

a. Increase the dimension of TABLE by 1 (i.e., for the first addition, TABLE (6) is the correct dimension). 

b. Add the following DATA statement to the CONTROL module: 

DATA ABCDE/5HABCDE/ 
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c. Increase the value of MODULS by 1 (i.e., for the first addition, MODULS/5/ is changed to MODULS/6/). 

d. The module selector statement “GOTO (101,102,103,104,105),!” should be changed to add the state- 
ment number of a CALL CHAIN command. For the first addition, the GOTO statement is changed to: 

GOTO (101,102,103,104,105,106),! 

and the following statement is added after statement 105: 

106 CALL CHAIN (20) 
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CHAPTER 4 
SAMPLE OPERATION 

The user dialogue with STATPAC and the possible data output are illustrated in this chapter. Each of the options and 
plots which may be requested in the modules is included in the output. The responses to the initial dialogue are for 
illustrative purposes only, and are not intended as examples of statistically meaningful responses. 

The user is referred to the following books for more complete descriptions of the statistical applications of the various 
options and plots. The selection of tolerances and test means, variances, etc., is discussed in these references. 

Descriptive Statistics: Statistics in Research 

by Bernard Ostle 
Chapter 7 

Quality Control and Industrial Statistics 

by Acheson J. Duncan, PH. D. 

Chapter 4 

Regression Analysis: Applied Regression Analysis 

by N. R. Draper and H. Smith 
Chapters 1,2, 3 ,4, and 6 

BMD Biomedical Computer Programs 

edited by W. J. Dixon 
Pages 233-257 

Mathematical Methods for Digital Computers 

edited by Anthony Ralston, PH. D. 
and Herbert S. Wilf, Ph. D. 

Chapter 17 

The operation of the CONTROL module is not illustrated explicitly since the only question used specifies the analysis 
module desired by the user. 

The data used to illustrate the STATPAC modules was obtained horn BMD Biomedical Computer Pro^ams, published 
by the University of California Press, used with permission of the editor, Mr. W. J. Dixon. 
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4.1 INPUT EXAMPLE 


The following Teletype listing is an example of the dialogue for the INPUT module. Any printed line which is started 
with an asterisk (*) is typed by STATPAC; all other lines are typed by the user. Comments are added to the listing to 
aid the reader. 

The message 

***WARNING - COMMON SIZE DIFFERS*** 

is often typed during the execution of STATPAC. This message is typed by CHAIN and should be ignored by the 
STATPAC user. 

Following the INPUT dialogue, a complete listing of the BMD data is provided to illustrate the format used. This data 
(its source was credited at the beginning of this chapter) is analyzed by the STATPAC modules and is used throughout 
this chapter to illustrate the operation of the various modules. 


SE STATPC 


♦PROG 

INPUT 


♦FILE (OLD) 


BMD ^ SRC 
♦FILE (NEW) 

BMD 

♦ FORMAT^- 

( F7 . 2:.(7^7 . 0 , 2F7 . 2> 2F 7 . 0 ) 
♦N0 . OBS . 

00068 

♦ VARS 

VAR0 1 =ONEBM 
VAR02=TWOBM 
VAR03=TREBM 
VAR04=ORTBM 
VAR06=SIXBM 
VAR05=FI VBM 

< 

♦ZERO OBS. 


3 spaces 


note use of RUBOUT key 


blank record (i.e., simple carriage return) 


♦0 .K. 
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O 5 

'■' 1 1 3m 
m33P\ 
P!0 1 7 “5 
<3(3 3 <3 3 
33333 
0055'^ 
00600 
00 1 30 
00 500 
00500 
00300 
00P00 
00200 
00 100 
Pj ^ 0^ 

000 50 
00025 
0 1 400 
00250 
00350 
00350 
00250 
000 50 
00200 
00700 
00400 
0 1 500 
00100 
00350 
0 1300 
00200 
0 1 200 
00400 
00300 
00R00 
00 900 
00600 

00 R00 

001 50 
00700 
00R00 
00200 
00600 
0 1 500 
0 1 700 
0 1 600 
00300 
00600 
0 1 400 
00 600 
00 1 R0 
0 1500 
0 1 800 
00 500 
03000 

02 900 
00 1 R0 
0 1 300 
0 1 900 
0 1100 
01000 
00600 
00500 
00 100 
0 1 700 
00500 
00130 


00 0 0 5 
00021 
00022 
0 0 0 0 9 

0 00 23 
000 10 
0000 7 
00006 
0000R 
0 00 18 
00003 

0 0 0 0 R 
0000 6 
00008 
00022 
000 1 3 
00026 
00023 

00003 

000 1 5 
00028 
000 0 6 
000 3 5 
000 1 1 
000 1 1 
00032 
00008 
00023 
00038 
000 1 5 
0000 6 

00025 
00005 

000 0 9 

00007 
00020 
0000 6 
000 12 

00026 
000 1 5 
000 1 0 
0002R 
00034 

00004 
00032 
000 1 1 
00002 
000 1 R 
00003 
0000R 
000 1 4 
000 1 2 
00003 
0000 6 
000 12 
000 1 1 
0000 R 
0002 4 
00026 

00029 
000 1 7 
000 1 5 
000 1 0 
00022 
000 1 5 
0000 9 

00030 
000 1 0 


00500 
02100 
02200 
0 0 130 
02300 
00060 
00 1 40 
00 0 R0 
002 70 
00360 
00 1 00 

002 70 
00300 
00 100. 
02200 
0 1 300 
00 1 20 
02300 
00 1 00 
00250 
0 1 400 
00060 

03 500 
0 0 200 
01100 
03200 
00 100 
02300 
03800 
00 500 
00 120 
02500 
00 1 70 
00075 
00350 
02000 
000R6 
00400 
00 1 60 
00 300 
00090 
02800 
03400 
000R0 
03200 
0 1100 
000 50 
00 1 60 
00040 
00110 
00090 
00240 
00 1 50 
00550 
0 0 20 0 
0 1100 
00R00 
02400 
02 6710 
02 900 
0 1 700 
00 500 
00500 
02200 
00 500 
00300 
0 3 500 
00 1 30 


00 150 

0 0 0 9 7 

00043 

0 0 1 R 0 
00200 
00330 
00340 

00500 

00 1 50 
0 0 1 P0 
00 1 40 
00 100 
00 1 50 
00250 
00110 
002 80 
000 73 
000 1 0 
00350 
00028 
0000 1 
00500 

00570 
00 3 40 
000 50 
00 6 60 
00450 
000 1 5 
00220 
00 1 50 
003 70 
00 1 00 
00030 

00 1 90 
00260 
00220 
00250 
00 120 
00110 
00 1 60 
01000 
00420 
00090 
00360 
00 1 R0 
002 30 
00 1 80 
00110 
00 1 30 
00200 
000 70 
00 1 50 
000R0 
00570 
00410 
00200 
00 100 
00110 
00 1 70 
0 4800 
00 1 60 
00350 
00 1 00 
00 1 20 
00080 
0 1 300 
00090 
00900 


0 0 0 3 4 
0 0 0 3 6 
0 0 0 4 1 
0 0 0 1 5 
00033 
000 1 3 
000 1 6 
000 1 1 
000 19 
0 0 0 P 7 
000 1 4 
0002 5 
0002 1 
000 1 8 
0004 6 
0 0 0 1 7 

00048 

00036 
0000*5 
00033 
00 0 4 6 
000 1 0 
00038 
000 1 6 
00020 
00038 
000 1 2 

00049 
00043 
00033 
00009 
00035 
0002 1 

000 1 7 

0001 2 
00030 
000 15 
00020 
00035 
00029 
000 1 2 
00040 
00042 
000 1 1 
000 44 
000 1 4 
000 1 1 
00032 
000 1 5 
00017 
00029 
0002 1 
000 1 3 
00009 
000 1 6 
0 0 022 
00022 
00038 

00038 
00029 
0002 5 
000 1 9 
00026 

00039 
00029 
000 1 0 
00058 
0 0 0 10 


000 64 
000 65 
00082 
0 0 0 23 

00064 
000 1 6 
000 ] 0 
00027 
00048 

00050 
000 1 2 
000 1 3 
00020 
00023 
00118 
000 50 
00063 
00 1,50 
000 72 
00054 
00 109 
000 10 
00 125 
00044 
00048 
00 105 
0000 9 
00 1 30 
00 1 60 
000 48 
00036 
00 1 50 
000 78 
00023 
00042 
000 72 
00020 
00036 
00056 
00036 
00026 
00 108 
00 106 
000 1 6 
00104 
00047 

00027 
000 1 2 
00007 
000 1 8 
00 028 
00025 
000 1 1 
00020 
000 1 4 

0003 R 

00 103 
00 106 
00063 
00208 
000 32 

00028 
0 0032 
00 100 
000 50 
00080 

00065 
00025 
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4.2 SMMRY EXAMPLE 


The following statistical output is the result of analysis by SMMRY on the previously presented BMD data and two 
arbitrary files, DATAl and DATA2. The arbitrary data files are included to allow the user to select all six options of 
SMMRY for demonstration purposes. Thus the complete printed output is presented with options. 


♦PROG 

SMMRY 

♦ ♦♦WARNING - COMMON SIZE DIFFERS^^+ This message should be ignored. 
♦♦♦WARNING - COMMON SIZE DIFFERS^^^ 


♦FILE 


User requests analysis of three data ffles • BMD, DATAl , and DATA2. 


BMD 

DATAl 

DATA2 


♦VARS 

01 

04 

05 
02 
03 


♦ OPTS 
221355 

♦ MEAN 
MEN01 =1.6 
MEN02 = 1 .7 
MEN03 = 1 .8 
MEN04=2 .3 
MEN05=2 .5 


User types the variables to be analyzed - no specific order necessary. 


User requests all six options - any nonzero digit may be used to request 
an option. 

User types the values for the test mean of each variable - Option 1 
was requested. 


♦VRNC 
VAR01 =4 . 1 
VAR02 = 4 .2 
VAR03=4 .3 
VAR04 = 4 .4 
VAR05=3 .5 


♦ 0 .K. 


User types the values of the test variances of each variable - option 2 
was requested. 


Processing will now begin. 


Carriage returns typed to terminate lists. 


DESCRIPTIVE STATISTICS 


BMD ^ Name of file being analyzed. 

NO. OBS. = 68 


VARIABLE MEAN 

NO. NAME 


VARIANCE 


STANDARD 

DEVIATION 


STANDARD 

ERROR 


1 ONEBM 0.69956E+01 

2 TWOBM 0.15250E+02 

3 TREBM 0 . 1 0425E+02 

4 QRTBM 0.30996E+01 

5 FIVBM 0.25397E+02 


0.41909E+02 0.6473 
0.87563E+02 0.9357 
0 . 1 351 9E+03 0.1 1 62 
0.35966E+02 0.5997 
0.15574E+03 0.1247 


BE+ 

01 

0 . 

. 7B506E+ 

00 

5E+ 

01 

0 . 

■ 1 1 3 48E+ 

01 

7E+ 

02 

0 . 

. 1 4100E+ 

0 1 

1 E+ 

0 1 

0 • 

. 72726E + 

00 

9E+ 

02 

0 . 

. 1 51 34E+ 

01 
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SKEWNESS KURTOSIS MAX MIN RANGE 

1 0.1500RE+01 0.21928E+01 0.30000E+0? 0.25000E+00 0.29750E+02 

2 0.57906E+00 ~ 0 = 800 53 £■!• 0 0 0-3P000E+02 0iP2000R + 0l pi ^ 3 rj. 0p 

3 0.89081E+00 -0.72163E+00 0.38000E+02 0.40000E+00 0.37600E+02 

4 0.62546E+01 0.43371E+02 0.48000E+02 0.10000E-01 0.47990E+02 

5 0.43154E+00 -0.85983E+00 0.5R000E+02 0.50000E+01 0.53000E+02 


CORRELATION MATRIX 

QRTRM 4 FI VBM 5 


1 .000000 

-0 . 1 40437 1 .000000 

The name and subscript of the third variable in the 
analysis. 


DATAl 

NO. ORS. = 25 

VARIABLE MEAN VARIANCE STANDARD STANDARD 

NO. NAME DEVIATION ERROR 

1 ONE0] 0.45600E+02 0.12417E+02 0.35237E+01 0.70475E+00 

2 TWO0] 0.31300E+03 0.14R40E+05 0.12182E+03 0.24364E+02 

3 TRE01 0.24136E+03 0.15286E+04 0.39097E+02 0.78194E+01 

4 QRT01 0.10868E+03 0.32631E+03 0.18064E+02 0.36128E+01 

5 FIV01 0.37440E+02 0.23059E+03 0.15185E+02 0.30370E+01 


SKEWNESS KURTOSIS MAX MIN RANGE 

1 -0.46517E+00 0.12996E+01 0.51000E+02 0.35000E+02 0.16000E+02 

2 0.68285E+00 0.10441E+01 0.67700E+03 0.13900E+03 0.53B00E+03 

3 0.14744E+01 0.18508E+01 0.35700E+03 0.19800E+03 0.15900E+03 

4 0.46925E + 00 -0.84454E + 00 0.14700E+03 0..79000E+02 0.68000E+02 

5 0.33407E+00 -0.13492E+01 0.64000E+02 0.15000E+02 0.49000E+02 


CORRELATION MATRIX 

ONE01 1 TWO01 2 TRE01 3 QRT01 4 FIV01 5 

1 ONE01 1.000000 

2 TWO01 0.01155] 1.000000 

3 TRE01 -0.291373 -0.179528 1.000000 

4 QRT01 0.352039 -0.029065 -0.460303 1.000000 

5 FIV01 0.310231 -0.164700 -0.333221 0.938052 1.000000 
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DATA? 


NO- OBS. = ID 


VAR I ABLE 
NO. NAME 

MEAN 

VAR I ANCE 

STANDARD 

DEVIATION 

STANDARD 

ERROR 

1 ONE02 

2 TWO02 

3 TRE02 

4 QRT02 

5 FIV02 

0 . 7R000E+0 1 
0.51 500E+0? 

0 . 1 2400E+0? 

0 . P5R00E+0? 

0 . 9R340E+0P 

0 . 39956E+0? 

0 . P0R2RE+03 

0 . 47RP2E+02 

0 . 1 R907E+03 

0 .202 1 4E+03 

0 . 632 1 0E+0 1 

0 . 1 4432E+02 

0 . 69 1 54E+0 1 

0 . 1 3750E+0? 
0.1421 RE+02 

0 . 1 99R9E+0 1 
0 . 45637E+0 1 
0 .2 1 R6RE+0 1 
0 . 43482E+0 1 
0 .44961 E+0 1 


SKEWNESS KI.JBT0.STS 


MAX 


MIN 


RANGE 


1 0.5E356E+DD 
? -0.1P735E+00 
3 D.376/i3E+D0 
A 0.12S9PE+D0 
5 -0.35647E+00 


-0 . 7D5 1 1 E+OD 
-0.1 53f^0E + O 1 
-0 . 1 67R5E+D 1 
-0 . 1 A73RE+0 I 
-0 . 1 33E6E+D 1 


0 . P 1 000E+0P 
0 . 71 000E+0P 
0 . P300nE+0P 
0 . 47000E+02 
0.11 590E+03 


0 ) . ] 0(7|f;i0£+0 ] 

0.31 000E+0? 
(7) . 40000 E+0 1 
0 . 60000 E+0 1 
0 . 72500E+0P 


0 . P0000E+0P 
0 . 40000E+0P 
0 . 1 9000E+0P 
0.41 000E+02 
0 . 43400E+0P 


CORRELATION MATRIX 




ONE0? 1 

TWO0? 2 

TRE02 3 

QRT02 4 

FIV0? 5 

1 

ONE0 2 

1 .000000 





2 

TWO02 

0 .095004 

1 .000000 




3 

TRE0? 

-0 . R7237 1 

-0 .2493R4 

1 .000000 



4 

QRT0? 

-0 . 107R96 

-0 . 972027 

0.150505 

1 .000000 


5 

FI V0? 

0.729910 

0 . 72R97R 

-0 . 71 4R46 

-0 . 737734 

^ 1 .(75 9(71000 


OPTION 1 
VAR USER MEAN 

RMD 



1 

2 

3 

4 

5 


Answers to 


0 . 1 600 E+0 1 
0 . 1 700E + 0 1 
0 . 1 R00E+0 1 
0 .P300E+0 1 


0 .P500E+01 

/ 

Question 4 


0 . 6R73E+01 
0.11 94E+0P 
0 . 61 1 7E+0 1 
0 . 1 099E+0 1 
0.151 3E+0P 

t 


0 . 6P43E+0P 
0 . 1 27RE+0P 
0 .3064E+0P 
0 . P945E+0P 
0.11 50E+0P 

J 


0 . 3 1 0PE+0 1 
0 . 1 09 1 E+0? 

0 . 4R47E+0 1 
0 . 5405E+0 1 
0 . ? 1 3?E+0? 

t 

— J t-values to be compared with 

values obtained from t-table. 


OPTION 2 


VAR USER VARNC 


RMD 


67 DATAl 24 DATA? 9 


1 0.4100E+01 

? 0.4200E+01 

3 0.4300E+01 

4 0.4400E+01 

5 0.3500E+01 
Answers to Question 5_^ 


0 . 6R49E+03 
0 . 1 397E+04 
0.21 06E+04 
0 . 5477E+03 
P 29R1E+04 



0 . 726RE+0? 
0 . R4R0E+0 5 
0 . R532E+04 
0 . 1 7R0E+04 
0 . .1 5R 1 E + 0 4 


0 . R77 1 E+02 
0 . 4463E+03 
0 . 1 00 1 E+03 
0 . 3R67E+03 
0.51 9RE+03 


¥ chi-square values to be compared with values obtained 
from chi-square table. 
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O'^TIOiX' 3 


VAR = 1 


HMD 67 
nATA] RA 
OAT AS 9 


HMO 


67 OATAl P4 data? 9 


I' I . OriPD £+00 
0 . ?P? 5F;+0 2 
0 . 3679E+00 


(-0 .9B95£+0!:^"^-0 . 3679E+09i 

pi.nOOP!E+0 0 n.9963E+00 

-0-2963E+P19 9.pi000E+00 


t-statistic comparing variable 1 in BMD and DATAl I 


VAR = 9 


RMD 67 
DATAl ?4 
OATAR 9 


RMD 67 

0 .0000E+00 
0 -RO 1 PE+0? 
0.1061 E+0? 


OATAl 94 

-0 .90 1 RE+09 
0 .0000E+00 
- 0 . 6 7 1 0 E+ 0 1 


DATA? 9 

-0 . 1 0 61 E+09 
0 . 671 0 E+0 1 
0 . 00 00 E+ 00 


VAR = 3 


RMD 67 DATAl 94 DATA9 9 


BMD 6 7 
DATAl 94 
DATA9 9 


0 .0000E+00 
0 . 4404E+09 

0.591 9E+00 


- 0 . 440 4E+09 
0 .0000E+00 
-0 . 1 89 5E+09 


-0 . 59 1 9E+00 
0 . 1 895E+09 
0 . 0000 E+00 


VAR = 4 


BMD 67 DATAl 94 DATAR 9 


BMD 67 
DATAl 94 
DATA9 9 


0 .0000E+00 
0 . 4955E+09 
0.91 13E+01 


-0 . 4955E+09 
0 .0000E+00 
-0 . 1 303E+09 


-0.91 1 3 E+0 1 
0 . 1 30 3 E+09 
0 . 0000E+00 


VAR = 5 


BMD 67 

DATAl 94 

DATA9 9 


RMD 6 7 

0 .0000E+00 
0 . 38P7E+0 1 
0 . 1 696E+09 


DATAl 94 

-0 .3887E+0 1 

0 .0000 E+00 

0 . 1090E+09 


DATA9 9 

-0 . 1 696E+09 
-0 . 1 090E+09 
0 .0000E+00 
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OPTION 4 


VAR = 1 


BMD 67 DATAl ?4 DATA2 9 

BMD 67 O.1000E+01 0.3375E+01 0.1049E+01 
OATAI 24 0.2963E+00 0.1000E+01 0.310BE+00 
DATA2 9 0.9534E+00 0.3218E+01 0.1000E+01 



BMD 67 DATAl 24 DATA2 9 


BMD 67 
DATAl 24 
DATA2 9 


0 . 1 000E+0 1 
0.1131 E+02 
0 .3537E+00 


0 .R844E-0 1 
0 . 1 000E+0 1 
0.3129E-01 


0.2827E+01 
0.31 96E+02 
0 . 1 000E+0 1 


VAR = 4 


RMD 67 DATAl 24 DATAB 9 

RMO 47 0.1000E+01 0.1102E+00 0.1902E+00 
OATAI 24 0.9073E+01 0.1000E+01 0.1726E+01 
DATA2 9 0.5257E+01 0.5794E+00 0.1000E+01 


VAR = 5 


BMD 67 DATAl 24 DATAB 9 

8Mn 67 0.1000E+01 0.6754E+00 0.7704E+00 
DATAl 24 0.1481E+01 0.1000E+01 0.1141E+01 
DATA2 9 0.1298E+01 0.R766E+00 0.1000E+01 
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OPTION 5 


VAR F-VALUE 

1 O.4252E+03 

2 0.2429E+03 

(3 0 « 1 1 42E+0 4) ^ F-cfatictif' comparing variable 3 in all files of the analysis. 

4 0.9504E+03 

5 0.2280E+03 


VI = 2 

V2 = 100 


Degrees of freedom parameters. 


OPTION 6 

VAR UNCORRECTEO CORRECTED 


1 0«i062E+02 

2 0.2417E+03 

3 0.7398E+02 

4 0 . 52 1 5E+02 

5 0.1527E+01 


0 • 9609E+0 1 
0.21 87E+03 
0 . 6694E+02 
0.471 9E+02 
0 . 1 3R1 E+0 1 


K - 1 = 2 

C = 0.1105E+0 1^ — Correction factor 


4.3 STPRG EXAMPLE 

The following example illustrates the STPRG module of ST ATP AC including all options and plots. The previously 
presented BMD data is analyzed with variable 6 as the dependent variable. STPRG performs four steps and stops after 
including independent variables 1,3»4, and 5 in the regression model. Variable 2 was not entered into the model be- 
cause the value of F (ENTER) never exceeded the value of F-IN (=0.5). 


=t=PR0G 

STPRG 

♦ ^♦WARNING - COMNON SIZE DIFFERS^** This message should be ignored. 
*F I LE User types the name of the file to be analyzed. 


BMD 

+VARS User types the variables to be considered for the model. The last 

0 1 variable listed is used as the dependent variable. 

04 

05 
02 
03 

06 

*F I N User types the F-value to determine entry. 

. 5 

*F 01 JT User types the F-value to determine exit. 

.3 

* L I M User types the maximum number of iterations. 

06 

* T 0 L User requests the default tolerance (0.00 1 ). 

+ 0PTS User requests all options. 

1111 

*C OEF User supplies the test coefficients for option 2. 

COF01 =1.1 
COF02 = 1 .2 
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COF03 = 1 .3 
COF05 = 1 .4 
COF04=1 .5 
COF02 = 1 .2678 

User types simple carriage return. 

*FCTR User supplies the value for t(N-p,l-*/ia). 

2 .0 

♦P LT S User requests all plots. 

1 1 1 1 

♦0 • K . Processing will now begin. 


s£G2E.':i.S J ON !)lJ'l'2i,JT (.ST^’^sM:■) Stepwise regression 


DATA FILE Name of file (Question 1). 

NO. ORS. 6P! 

RESP. 6 SIXPM ^ Subscript and name of dependent variable. 

Response to Question 6. 

Responses to Questions 3 and 4. 


VARIABLE CORR. X..VS.Y 

Response to Question 2. 

.0820 56 
0 . 74961 7 
0.786058 
0 .342568 
0.645167 

Responses to Question 5 of INPUT. 


2 

3 

4 

5 

W 


ONEBM 
TWOBM 
TREBM 
ORTBM 
FI VBM 

4 


TOL. 0.00100^ 
F-IN 0 . 50000 '! 
F-OUT 0.30000] 


STEP NO. 1 


VAR. ENTERING 3 TREBM 
SEQ.F-TEST 106.724 
DEGREES OF FREEDOM 66 
CHANGE IN R-SQ 0.617888 



R-SO 

STD. ERR. Y 


0.61 
27 . 


7888 

1238 



Subscript and name of first variable in model 
F-test to determine entry (106.724 > 0.5) 

Change in multiple correlation coefficient 
New multiple correlation coefficient 
Standard error of dependent variable. 


A N 0 V A ^ Analysis of V ariance 


SOURCE 0. 

F. SUM OF SQUARES 

MEAN SQUARE 

OVERALL F 

TOTAL 

REGRS . 

67 0.127073E+06 

1 0.785169E+05 

0.7851 69E+05 

0 . 1 0 6724 E + 03 

RES ID. 

66 0.485562E+05 

0 . 735700E+03 


\R. IN REG. 

VARIABLE 

COEFFTC I ENT 

STi). PIRROR 

P (REMOVE) 

3 TREBM 

0.294426E+01 0 

. 28 5000 E+ 00 

T . 1 06724E+03«^ 


F-value needed to remove this variable from 
the present regression model 

B0 = 2 6.099 8 ^ Constant term of the model 




4-10 



VAR. NOT IN REG 


VARIABLE 

TOLERANCE 

PARTIAL CORR. 

F (ENTER) 

1 ONEBM 

P;, 9^9 974 

0 . 1 262 1 6 . 

0 . 1 05225E+0 1 

2 TWOBM 

0 . 246590 

0.21 9325 

0 . 32 8 4 73 E+0 1 

4 ORTBM 

0 . 984 1 58 

0.397285 

0 . 1 2 1 82 1 E+02 

5 FIVBM 

0 . 434591 

A 

0 . 1 32 7 60 

r-i . 1 1 662 1 E+0 1 


ta- 

u 

Partial correlation 
with y 

F-values needed to enter these 
variables in the present regression 
model 


VAR. ENTERING A Q R T R M Variable 4 enters regression model 

SEQ. F-TEST IR.lRRl 

DEGREES OF FREEDOM 65 
CHANGE IN R-SO D. 0603 11 
R-SO 0.678199 

STD. ERR. Y 25.0R91 


ANOVA 

SOURCE 

TOTAL 
REGRS . 
RES ID. 


D.F. SUM OF SQUARES 

67 0.1P7073E+O6 

R 0.861R0BE+05 

6b 0.408993E+05 


MEAN SQUARE , OVERALL F 

n.430904E+05 0 . 6R4940E+02 

0 . 629 1 1 2E+03 


VAR. IN REG. 


VARIABLE 

3 TREBM 

4 ORTBM 


COEFFICIENT 

0 .2R2755E+0 1 
0 . 1 79768E+0 1 


STD. ERROR 

0 . 265660E+0n 
0.51 5052E+00 


F (REMOVE) 

0.11 3284E+03 
0 . 1 2 1 82 1 E+02 


BO = 21. 7445 


VAR. NOT IN REG. 
VARIABLE 

1 ONEBM 

2 TWOBM 
5 FIVBM 


TOLERANCE 

0 . 933987 
0.246516 
0.378439 


PARTIAL CORR. 

0.027242 
0.246529 
0.321 789 


F (ENTER) 


0 .475306E-O1 
. 4 1 41 41 E+0 1 
0 . 739256E+0 1 


STEP NO. 3 

VAR. ENTERING 5 FIVBM ^ Variable 5 enters the regression model 

SEQ. F-TEST 7.39256 

DEGREES OF FREEDOM 64 
CHANGE IN R-SQ 0.033322 
R-SQ 0.711521 

STD. ERR. Y 23.9328 


ANOVA 


SOURCE 

D.F. 

SUM OF SQUARES 

MEAN SQUARE 

OVERALL F 

TOTAL 

67 

0 . 127073E+06 



REGRS . 

3 

0 . 9041 51 E+05 

0.301384E+05 

0 . 5261 76E+02 

RESID. 

64 

0 . 36658OE+05 

0 . 572781 E+03 
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VAR. IN REG 


VARIABLE 

COEFFICIENT 

STD. ERROR 

F (REMOVE) 

3 

TREBM 

0 . 1 95840E+01 

0 . 407974E+00 

0 .230429E+02 

4 

ORTBM 

0.231239E+01 

0.526652E+00 

0 . 1 92786E+02 

5 

FIVBM 

0. 103553E+01 

0 . 38O860E+OO 

0 . 739256E+01 


2.91072 





NOT IN REG. 




VARIABLE 

TOLERANCE 

PARTIAL CORR. 

F (ENTER) 

1 ONEBM 

0.8831 69 

0.111114 

0 . 787547E+00 

2 TWOBM 

0.11 3447 

0.015737 

0 . 1 56066E-0 1 


STEP NO. 4 ^ Last step - no more variables to be entered or removed 


VAR. ENTERING 1 ONEBM>^ 

SEQ. F-TEST 0.7R7547 

DEGREES OF FREEDOM 63 


CHANGE IN R-SQ 
R-SQ 

STD. ERR. Y 


0.003562 
0.71 50R2 
23 .9727 


Variable 1 enters regression model 


ANOVA 


SOURCE 

D . F . 

SUM OF SQUARES 

MEAN SQUARE 

OVERALL F 

TOTAL 

67 

n . 1 2 7073E + 06 



REGRS . 

4 

0 . 908677E+05 

0.2271 69E+05 

0 .395291 E+08 

RES ID. 

63 

0 . 362054E+05 

0 . 574689E+03 



All values are greater than F-OUT (0.3) 

F (REMOVE) 

0 . 7R754 7E+00.^ 


VAR. IN REG. 
VARIABLE 

1 ONEBM 

3 TREBM 

4 ORTBM 

5 FIVBM 


COEFFIC I ENT 

rj . 4R 720RE+00 
0 . 1 R9677E+0 1 
0 .223335E+01 
0.111 674E+0 1 


STD. ERROR 

O . 4R 1 394E+00 
0.414512E+00 
0 . 534995E+00 
0 . 39231 6E+00 


0 . 2093R9E+02 
0 . 1 74266E+02 
0 . 8 1 0276E+0 1 


BO = -1.25284 

VAR. NOT IN REG. 


Less than F-IN (0.5) — i 


VARIABLE 


TOLERANCE PARTIAL CORR. 


F (ENTER) 


2 TWOBM 


0 . 1 02908 


O .0'52408 


0 . 1 70755E+00-< 
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RESIDUAL OUTPUT 


D. 

OPS 

PRED 

RES ID 

1 

0 . 640000E+02 

0 . 885535E+02 

-0 .245535E+02 

R 

0 . 650000E+02 

0 . R62786E+02 

-0.212786E+02 

3 

0 . 820000E+02 

0-887180E+02 

-0.671 796E+0 1 

4 

0 . 230000E+02 

0.22731 7E+02 

0.26R293E+00 

5 

0 . 640000E+02 

0 . 849735E+02 

-0 .209735E+02 

6 

0 . 1 60000E+02 

0.226273E+02 

-0 . 662731 E+01 

7 

0 . 1 20000E+02 

0.292135E+02 

-0. 1 72135E+02 

8 

0 . 270000E+02 

0 .262787E+02 

0 . 72 1 302E+00 

9 

0 . 480000E+02 

0 .28991 9E+02 

0. 1 90081 E+02 

10 

0 . 500000E+02 

0.41 8836E+02 

0 • 81 1 641 E+01 

1 1 

0 • 120000E+02 

0.21 5410E+02 

-0 . 9541 02E+0 1 

1? 

0 . 1 30000E+02 

0.353019E+02 

-0.2230 1 9E+02 

1 3 

0 . 200000E+02 

0.320935E+02 

-0. 120935E+02 

1 4 

0 . 230000E+02 

0.27 1830 E+02 

-0.41 8304E+0 1 

1 5 

0.11 8000E+03 

0 . 947300E+02 

0 .232700E+02 

i 6 

0 . 500000 E+02 

0. 50351 9E+ 02 

-0 . 35 1 9 1 4E+00 

1 7 

0 . 630000E+02 

0 . 564708E+02 

0 . 652920E+0 1 

1 8 

0 . 1 50000E+03 

0 . 829056E+02 

0 . 670944E+02 

1 9 

0 . 720000E+02 

0.200252E+02 

0.51 9748E+02 

20 

0 • 540000 E+02 

0 .420349E+02 

0 . 1 1 9651 E+02 

21 

0 . 109000E+03 

0 . 781 895E+02 

0 .3081 05E+02 

22 

0 . 100000E+02 

0.237146E+02 

-0.1371 46E+02 

23 

0 • 125000E+03 

0. 121368E+03 

0.3631 78E+01 

24 

0.440000E+02 

0.2821 55E+02 

0 . 1 57845E+02 

25 

0 . 480000E+02 

0 . 4391 75E+02 

0 . 408250E+0 1 

26 

0 . 105000E+03 

0.11 9610E+03 

-0 . 1 461 04E+02 

27 

0 . 900000E+01 

0 .258037E+02 

-0 . 1 68037E+02 

28 

0 . 1 30000E‘+03 

0 . 103836E+03 

0.261 638E+02 

29 

0 . 1 60000E+03 

0. 1241 85E+03 

0.3581 53E+02 

30 

0 • 480000E+02 

0.499287E+02 

-0 . 1 92869E+0 1 

31 

0 . 360000E+02 

0.248910E+02 

0.11 1090E+02 

32 

0 . 1 50000E+03 

0 .883400E+02 

0 . 61 6600E+02 

33 

0 . 780000E+02 

0.3121 97E+02 

0 . 467803E+02 

34 

0 .230000E+02 

0 .251065E+02 

-0 .2 1 0652E+0 1 

35 

0 . 420000E+02 

0 .258751 E+02 

0 . 1 61 249E+02 

36 

0 . 720000E+02 

0. 7851 57E+02 

-0 . 651 573E+0 1 

37 

0 .200000E+02 

0.265577E+02 

-0 . 655773E+01 

38 

0 . 360000E+02 

0.339123E+02 

0 .208769E+0 1 

39 

0 . 560000E+02 

0.467423E+02 

0 . 925773E+0 1 

40 

0 . 360000E+02 

0.410371E+02 

-0 . 50371 1 E+01 

41 

0 • 260000E+02 

0.391 791 E+02 

-0.131791E+02 

42 

0 . 108000E+03 

0. 109324E+03 

-0. 132396E+01 

43 

0 • 106000E+03 

0.11 3005E+03 

-0 . 700476E+01 

44 

0 • 1 60000E+02 

0 .231 520E+02 

-0.715201E+01 

45 

0 . 104000E+03 

0.11 9008E+03 

-0 . 1 50084E+02 

46 

0 . 470000E+02 

0 . 476452E+02 

-0 . 6451 88E+00 

47 

0 .270000E+02 

0 .228350E+02 

0.41 6496E+01 

48 

0 . 120000E+02 

0.412560E+02 

-0 .292560E+02 

49 

0 . 700000E+0 1 

0.21 7236E+02 

-0 . 1 47236E+02 

50 

0 . 1 80000E+02 

0.302658E+02 

-0. 122658E+02 

51 

0 .280000E+02 

0.369663E+02 

-0.896633E+01 

52 

0 .250000E+02 

0.308700E+02 

-0.586995E+01 

53 

0. 1 10000E+02 

0 .;343047E+02 

-0 . 1 33047E+02 

54 

0 .200000E+02 

0 .396499E+02 

-0 . 1 96499E+02 

55 

0 . 1 40000E+02 

0.31 701 3E+02 

-0 . 1 770 1 3E+02 

56 

0 • 380000E+02 

0 . 61 4628E+02 

-0 .234628E+02 
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57 

0 . 103000E+03 

0.531 120E+02 

0 . 498880E+02 

58 

0 . 106000E+03 

0.89931 4E+02 

0 . 1 60686E+02 

59 

0 . 630000E+02 

0 .998496E+02 

-0 . 368496E+02 

60 

0 . 208000E+03 

0.201 456E+03 

0 . 654361 E+0 1 

61 

0 . 320000E+02 

0 . 671 833E+02 

-0 . 351 833E + 02 

62 

0 .280000E+02 

0 .41 5379E+02 

-0 . 1 35379E+02 

63 

0 . 320000E+02 

0. 420629E+02 

-0 . 1 00629E+02 

64 

0 . 100000E+03 

0 . 888450E+02 

0.1 1 1 550E + 02 

65 

0 . 500000E+02 

0 . 428304E+02 

0.71 6963E+0 1 

66 

0 • 800000E+02 

0. 51 9009E+02 

0.28099 1 E+02 

67 

0 . 650000E+02 

0 . 1 34051 E+03 

-0 . 690510E+02 

68 

0 .250000E+02 

0.330358E+02 

-0 . 803584E+0 1 


OPTION 1 


VARIABLE 


T-VALUE 


1 ONEBM 0.887438E+O0 

3 TREBM 0.457590E+01 

4 ORTBM 0.417451E+01 

5 FIVBM 0.284654E+01 


Note that t-statistics are computed 
only for variables included in the 
model 


OPTION 2 


VARIABLE 



COEFF. 


_ Response to Question 8 
T-VALUE 


1 ONEBM 

3 TREBM 

4 QRTBM 

5 FIVBM 


0 . 1 10000E+01 
0 . 130000E+01 
0 . 1 50000E+01 
0 . 1 40000E+01 


-0 . 139759E+0 1 
0 . 1 43968E+0 1 
0 . 1 37075E + 0 1 
-0 . 7220 1 7E+00 


OPTION 3 


T( N-P t 1 -ALPHA/2 ) - 2.00000^ Response to Question 9 (FCTR) 

VARIABLE LOWER BOUND UPPER BOUND 


1 ONEBM -0 . 535581 E+00 

3 TREBM 0.106774E+01 

4 QRTBM 0.116335E+01 

5 FIVBM 0.332109E+00 


0 . 1 39000E+0 1 
0 .272579E+0 1 
0.330334E+01 
0. ]90137E+0i 


N-P 


63 ^ 


Degrees of freedom 



PLOT 1 


YPRED (X) VS RESID (Y) 


Horizontal . 


.671 E+02 . 
. 626E+02. 
.580E+02. 
.535E+02. 
.489E + 02 . 
.444E+02 . 
.399E+02 . 
.353E+02 . 
.308E+02 . 
.263E+02 . 
.21 7E+02 . 

. 1 72E+02 . 

. 126E+02 . 
.810E+01 . 
.356E+01 . 

. 978E+00 . 
.552E+01 . 
.101 E+02 


1 1 


1 1 1 


1 1 


3 2 2 


2 1 
2 


1 1 


1 1 


2 2 


1 46E+02 
1 91 E+02 . 
237E + 02 . 
282E + 02 . 
327E+02 . 
373E+02 . 
41 8E+02 . 
464E+02 . 
509E+02 . 


2 2 


1 1 


^ (X) VS F 


■Vertical 


1 1 


Two observations with same 
-value of y predicted and 
equal residuals 


"0 « 554E + 02 • 


-0 . 600E + 02 . 


-0 . 645E+02 . 


1 


-0 . 691 E + 02 

+ + + + 


+ 


+ 


+ 


0 = 200 E+02 

0 . ROSE+PiR 

0 . 1 41 E+03 

0.201 E+03 


0 . 503E+02 

0.111 E+03 

0.171 E+03 
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'LOT R 


0 .208E+03 . 
0.201E+03. 
0 . 1 95E+03 . 
0 • 1 88E + 03 . 
0- 181E+03. 
0 . 1 74E+03 . 
0. 168E+03. 
0.161 E+03 . 
0 . 1 54E+03 . 
0 . 1 48E+03 . 
0 . 1 41 E+03 . 
0 . 1 34E+03 . 
0 . 128E + 03 . 
0 . 121 E+03 . 
0.11 4E+03 . 
0. 107E+03. 
0. 101E+03. 
0 . 941 E+02 . 


YPRED (X) VS yOBS (Y) 


Horizontal 


Vertical 


1 


1 1 


1 


1 


1 


1 


1 


1 1 — Two observations with same 

value of y predicted and y 
1 observed. 


0 .874E+02 . 
0 .807E+02 . 
0 . 740E+02 . 
0 . 673E+02 • 
0 • 60 6E+02 • 
0.539E+02. 
0 .472E+02 . 
0 • 40 5E + 02 * 
0 .338E+02 . 
0.271E+02. 
0 .204E+02 . 
0.I37E+02. 
0.700E+01 . 


1 1 


1 1 
3 1 1 

1 

1 1 


1 2 
3 3 11 
2 4 1 

5 111 


2 1 


0 .200E+02 

0 • 805E+02 

0 . 1 41 E+03 

0 .201 E+03 


0 . 503E+02 

0.111 E+03 

0. 1 71 E+03 
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PLOT 3 


TIME VS RESID 

i A 


Vertical 


Horizontal 


-0 . 691 E+02 


-0.230E+02 


0.230E+02 


0 . 691 E+02 


-0 .460E+02 
+ + 


-0. 71 5E-06 
+ + 


+ 


0 . 460E+02 

+ 


+ 


1 

2 

3 

A 

5 

6 

7 

8 
9 

10 

1 1 
12 
1 3 
1 4 
1 5 
1 6 
1 7 
1 8 
1 9 
20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 


38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 


* 


* 


* 


* 


* 


* 


Observation ordinal is used as the 
measure of time. Plot should show 
uniform scatter with no discemable 
pattern 


* 


* 


* 
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57 

58 

59 

60 
6 1 
62 

63 

64 

65 

66 

67 * 

68 




* 


* 


* 


* 


* 


=(= 






PLOT 4 


RPSID HISTOGRAM 


1 0 Number of observations with a 

9 given residual value 

8 
7 
6 
5 
4 
3 
2 

1 1 


1 
1 

1 1 
1 1 

11111 
11111 
11111111 
11111111 
111111111 
111111111111 


1 

111 1 
11111 11 11 


+ 


+ 


+ . + 


+ 


+ 


-0.691 E+02 


-0 .230E+02 


0 .230E+02 


0. 691 E+02 


-0 . •460E+02 


-0.71 5E-06 


0.460E+02 


Value of the residual 


4.4 MLTRG EXAMPLE 

The following sample output is the result of analysis of the DATAl file with the MLTRG module of ST ATP AC. The 
output does not include options and plots, as the MLTRG options and plots are the same as those of STPRG previously 
presented. 


*PROG 

MLTRG 

♦ ♦♦WARNING - COMMON SIZE DIFFERS*** This message should be ignored 

*F ILE 


DATAl 

*VARS 


06 

Last variable typed is the dependent variable (all variables in the file 

*TOL 

will be entered in the regression model) 

.00103 


+OPTS 


1 001 

Only options 1 and 4 are requested 

*PLTS 


0111 

Only plots 2, 3 and 4 are requested 

*0 .K. 



4-18 



REGRESSION OUTPUT CMLTRG) 


OAT A FILE OATAl 
NO. ORS. R5 

R E S P • 6 S I X 0 1 Dependent variables 

TOL. 0.00103 


VARIABLE CORR. X.VS.Y 

1 ONEOl 0.06i=?706'^ 

R TWOOl 0.46R219 I 

3 TREO 1 0.111 343 ) 

4 QRTO! 0.30R764 I 

5 FIVOl 0.R90697 J 


R-SQ 0.494342 

STD . ERR . Y 1.1 5539 


ANOVA 

SOURCE D.F. SUM OF SQUARES MEAN SQUARE OVERALL F 

TOTAL 24 0.501 600E+02 

REGRS. 5 0.247962E+02 0.495924E+01 0.371496E+01 

RESID. 19 0.25363RE+02 0.133494E+01 


VAR. IN REG. 

VARIABLE COEFFICIENT STD. ERROR 

1 ONEOl 0.430B«4E-02 O.724055E-O1 

2 TWOOl 0.722737E-02 0.2121R3E-O2 

3 TREOl 0.158192E-01 0.72B874E-02 

4 QRTOl 0.148276E-01 O.444280E-O1 

5 FIVOI O.339402E-O1 0.505993E-01 


BO = -6.71895 


All independent variables are included 
in the model 
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APPENDIX A 

DESCRIPTIVE STATISTICS ALGORITHMS 


Xj-: mean of the variable in the i*^^ 


file. 



Z ^im 

m=l 


/Ni 


Oji : variance of the variable in the i'^^ file. 



^im 



/(Nj-l) 


tVi tin 

Ojj: standard deviation of the variable in the file. 


aji = V^ 


tin tVi 

S.E.j-: standard error of the mean of the j variable in the i^^ file. 


S.E^i = a.i/VNr 


SKEWNESSjj: coefficient of skewness of the variable in the i^^ file. 


N, 


Z (Xjinr-Xji)^) /Ni 

_\m=l 


SKEWNESSji = 




A-1 



KURTOSISj^: coefficient ofkurtosis of the variable in the i^^ file. 


KURTOSISji 



Cj.^: simple correlation coefficient between the and variable in the i^^ file. 


Z (Xrim-Xri)(Xsim-Xsi) 
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APPENDIX B 

REGRESSION ANALYSIS ALGORITHMS 


corr (x-,y): correlation of the dependent variable with the independent variable. 


corr (xj,y) 


N 

Z (Xim-Xi)(ym-y) 

m=l 


'N /N 

m=l V m=l 


O 

r : multiple correlation. 


N 


T :^2 


Z (Vi-y) 


j.2= i=l 


= 1 - a.. 


N 

Z (yj-y)^ 

i=l 


Syi standard error of y. 



(ym-y)^ 


Vann/(N-l-p) 
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Sequential F-test (Entering). 

F = {VmaxW-l)}/(a„„-V^3^) 

"Fere = maximum Vj. Vj = aj^ a^/aj; 

a-j = elements of the correlation matrix 

n = total number of variables being analyzed 
0 = degrees of freedom (N-l-p) 

Sequential F-test (Leaving) 

F’^fdVminD-^ /Sn 

where = minimum Vj, V- = a^^^ a^^j/ajj 

and other symbols as above. 


ANOVA Analysis of Variance Table 


D.F. (Degrees of Freedom) 

Total 

N- 1 


Regression 

P 


Residual 

N-l-p 

Sum of Squares 

Total 

N 

ssy= Z (ym-y)^ 

m=l 


Regression 

^^reg “ ^^y ' ^nn^ 


Residual 

^^resid ~ ^^y" ^^reg 

Mean Square 

Regression 

^^reg ~ ^^reg'^P 


Residual 

^^resid "" ^^resid^^^'^'P^ 

Overall F 

Table of Variables in Regression 

4.U 

bji coefficient of the i“‘ variable. 

h - h 
^i ^in a- 

Regression 

mSreg/mSresid 

f Vi f Vi 

where bj^^ = i*^^ element of last (n*^^) column of inverted correlation matrix 


a 


n 



(ym-y)^ 



2 
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Sj : standard error of b-. 



where Sy is standard error of y; 

Oj is defined as above; 

bj- is diagonal element of the correlation matrix. 

fh 

F^: F-test to remove the i^^ variable. 



bg : constant term of model. 

T 

bo = y- £ bjXj 
i=l 

where T = number of variables in regression 

Table of Variables not in Regression 
tol| : tolerance of i^^ variable. 

tol- = a” (the i^^ diagonal element of the inverted correlation matrix) 

Partial correlation of the i^^ variable. 

part, con.; = 

fVi 

F- : F-test to enter the i^^ variable. 

^i~{^in ■ ^^l^^^ii ^nn ■ ^in^ 

where 0 = N - 1 - p 
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Residual Output 

j : predicted value of the dependent variable for the i^^ observation. 
T 

Pi = bo+ IbjX-i 

j=i 

\vhere T = number of variables in regression. 

fVi 

e- : residual for the i^^ observation. 

e- = y- - y- 
1 -^1 
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HOW TO OBTAIN SOFTWARE INFORMATION 


Announcements for new and revised software, as well as programming notes, software 
problems, and documentation corrections are published monthly by Software Informa- 
tion Service in the following newsletters. 

Digital Software News for the PDP-8 Family 
Digital Software News for the PDP-9 Family 
Digital Software News for the PDP-15 Family 

These newsletters contain information applicable to softv/are available from Digital’s 
Program Library (see title page for address). Software products and documents are 
usually shipped only after the Program Library receives a specific request from a user. 

Digital Equipment Computer Users Society (DECUS) maintains a user library and pub- 
lishes a catalog of programs as well as the DECUSCOPE magazine for its members and 
non-members who request it. 

Please complete the card below to receive information on DECUS membership or to place 
your name on the newsletter mailing list. 


Please send 

D DECUS membership information, 
or add my name to the 

□ DECUSCOPE non-membership list. 

And, send me the Digital Software News for the 
□ PDP-8 □ PDP-9 


□ PDP-15 


NAME 

COMPANY 

ADDRESS 

CITY STATE ZIP 
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READER’S COMMENTS 

Digital Equipment Corporation maintains a continuous effort to improve the quality and usefulness of its 
publications. To do this effectively we need user feedback - your critical evaluation of this manual. 

Please comment on this manual’s completeness, accuracy, organization, usability, and readability. 



DEC also strives to keep its customers informed of current DEC software and publications. Thus, the following period- 
ically distributed publications are available upon request. Please check the appropriate box(s) for a current issue of the 
publication(s) desired. 

□ PDP-15 Software Manual Update, a quarterly collection of revisions to current software manuals. 

Q PDP-15 User’s Bookshelf, a bibliography of current software manuals. 

Q Program Library Price List, a list of currently available PDP-15 software programs and manuals. 

Please describe your position 

Name Organization — 
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Street 
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State 


Zip or Country 
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